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Description 

The purpose of the present invention is to develop a vaccine to prevent acquired innmunodeficiency 
syndrome (AIDS) based partly or solely on synthetic peptides which produde T cell immunity. T cell 
5 immunity is an important arm of defense against viral infections, but has hardly been studied for AIDS. 
Helper T cells are needed for an antibody response as well as for a cytotoxic T cell response and for 
inducing macrophage and LAK cell killing. While it is not yet clear whether cellular immunity or humoral 
immunity is the critical element in protection against a virus, the peptides of the present invention are 
particularly suited for use in a vaccine capable of safely eliciting either type of Immunity: (1) the peptides 

70 are synthetically produced, and therefore do not include live virus, or a part of a live virus, or other sites 
which might produce deleterious effects (e.g., the site binding to T4 or the site producing cell fusion); (2) 
the peptides may be used alone to induce cellular immunity; (3) the peptides may be used in conjunction 
with other molecules in order to induce antibody production or a humoral response; and (4) the peptides 
may be targeted for a particular type of T-cell response without the side effects of other unwanted 

15 responses. ) 

Much effort has been devoted to the analysis of antibodies to AIDS virus antigens, but no previous 
studies have defined antigenic sites of this virus which elicit T cell immunity, even though such immunity is 
important in protection against many other viruses. Analysis of immunodominant helper T cell sites has 
suggested that such sites tend to form amphipathic helices. Using an algorithm based on this model, two 

20 candidate T cell sites, env T1 and env T2, were identified in the human T cell lymphotropic virus type I lib 
(HTLV-lllb) envelope protein that were conserved in other Human Immunodeficiency Virus (HIV) isolates. 
Corresponding peptides were synthesized and studied in genetically defined inbred and Fi mice for 
induction of lymph node proliferation. After immunization with a 425 residue recombinant envelope protein 
fragment, significant responses to native gp1 20 as well as to each peptide were observed in both Fi 

25 combinations studied. Conversely, Immunization with env T1 peptide induced T cell immunity to the native 
gp120 envelope protein. The genetics of the response to env T1 peptide were further examined and 
revealed a significant response in three of four independent Major Histocompatibility haplotypes tested, an 
indication of high frequency responsiveness in the population. 

Identification of helper T cell sites should facilitate development of a highly immunogenic, carrier-free 

30 vaccine that induces both T cell and B cell immunity. The ability to elicit T cell immunity to the native AIDS 
viral protein by immunization with a 1 6-residue peptide suggests that such sites represent potentially 
important components of an effective AIDS vaccine. 

Since the discovery of the human immunodeficiency viruses, the causative agents of the acquired 
immunodeficiency syndrome (AIDS), substantial progress has been made toward characterizing the viral 

35 genes and their products in infected cells. Though responsible for profound immunodeficiency, viral 
infection in man consistently induces a detectable immune response as evidenced by serum antibodies to 
the major viral proteins. Studies of serum reactivity to specific viral proteins have revealed no consistent 
prognostic associations to date. Much of the host antibody response is focused on the envelope proteins 
gp120 and gp41. The ability of native gp120 or a large recombinant fragment to induce neutralizing 

40 antibodies has been demonstrated. Two apparently immunodominant antibody binding sites in the gp41 
envelope protein have been defined at the level of small synthetic peptides. Though such antibody sites are 
clearly of diagnostic importance and potentially of importance for vaccine design, the typical progression of 
AIDS in patients despite the presence of these antibodies suggests that effective T cell immunity is 
important to the immune defense against this pathogen. 

45 An ideal vaccine is highly immunogenic, induces both T cell and B cell virus-specific immunity, and is 
free of irrelevant carrier proteins. While traditional approaches using whole virion or virion subunits can 
generally achieve this, practical considerations such as safety and availability of native antigen have led 
many to consider more highly engineered vaccine constructs for AIDS. Localization of immunodominant T 
cell and 8 cell recognition sites becomes critical if one wishes to design a vaccine based on recombinant 

50 proteins or synthetic peptides. A T cell repsonse to the gp120 envelope protein has been demonstrated 
recently by Zarling et al. Nature , 323: 344-345 (1986) in macaques immunized with vaccinia constructs 
containing gp120 coding sequence. However, identification and characterization of immunodominant T cell 
sites within this 518 residue protein or other HIV proteins have not b en reported. 

Antibodies typically r cognize fr e antigen in nativ conformation and can potentially r cognize almost 

55 any site exposed on the antigen surfac . In contrast, typical CD4+ helper T cells r cognize antigen only in 
the context of the Class II Major Histocompatibility (MHC) molecule and only aft r appropriate antigen 
processing usually consisting of proteolysis or d naturation. Additionally, the polyclonal T cell response is 
focused only on relatively few discrete sit s. This limited r spons is seen ven for non ukaryotic proteins 

3 
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(e.g., influenza hemagglutinin and staphylococcal nuclease) for which toleranc to homologous host proteins 
does not limit the number of antigenic sites. Therefore, it is important to find sites which do elicit T-cell 
immunity to AIDS viral proteins. The elucidation of features determining immunodominance residing both 
intrinsic and extrinsic to antigen is the focus of much current basic and clinical interest. Detailed 

5 characterization of immunodominant T ceils sites has allowed exploration for general features. Such analysis 
led to the observation that Immunodominant T cell sites tend to have an amino acid sequence consistent 
with formation of an amphipathic helix with hydrophilic residues on one face and hydrophobic residues on 
the opposite face. In an amphipathic alpha helix, the hydrophobicity varies sinusoidally with a period of 3.6 
residues per turn of the helix or a frequency of 100* per residue. An amphipathic 3io helix has a period of 

10 3 and a frequency of 120'. Based on this model, an algorithm entitled AMPHI has been developed for 
identification of such sequences In proteins given only primary sequence data. 

Although Zarling et al., Nature 323: 344 (1986), showed that T cell immunity to the AIDS virus envelope 
could be induced in monkeys using a recombinant vaccinia virus carrying the gene for the whole envelope 
protein, their study did not identify antigenic sites stimulating these T cells. Also, vaccinia immunization has 

15 been discontinued in the United States because of danger of disseminated vaccinia infection and other side 
effects, A synthetic peptide vaccine would not carry any of these risks. Since it would be synthetic, there 
would be no risk of live AIDS virus contamination as might occur with a killed virus vaccine. A synthetic 
peptide which does not contain sites responsible for syncytia formation might produce fewer side effects 
than a large recombinant envelope protein containing such sites. 

20 

Summary of the Invention 



The present invention relates to a method of inducing cellular immunity to native proteins of the AIDS 
virus by Immunization with short synthetic peptides. Five potential peptides are identified by searching for 

25 regions which can fold as a maximally amphipathic helix. Two of these are recognized by T cells immune to 
the AIDS virus envelope protein. One of these was used to immunize mice and can successfully induce T 
cell immunity to the whole AIDS envelope protein (480 amino acid residues long) using this 16-residue 
peptide in 3 of 4 mouse strains tested. The Invention includes but is not limited to these specific peptides 
and any synthetic peptides containing or overlapping these sequences or variations therein, including 

30 peptides with amino acid substitutions that retain or enhance the activity documented. These can be used 
alone as a vaccine or in combination with other materials (e.g. primary immunization with peptide, boost 
with recombinant fragment). They can also be attached to sites binding neutralizing antibodies to induce a 
neutralizing antibody response. 

Specifically according to the invention there is provided isolated peptides or portions thereof adapted to 

35 serve as components of a vacine for acquired immune deficiency syndrome (AIDS), characterised in that 
said peptides are capable of inducing T cell response specific to acquired Immunodeficiency syndrome 
(AIDS) virus and in that said peptides are selected from amino acid sequences in single letter code: 



^0 FNMWKNDMVEQMHEDIISLWDQSLKPCVKLTLCV 
KTFNGTGPCTNVSTVQCTHG; IRIQRGPGRAFVTIGKIGNMRQAHC 
RAKWNNTLKQIDSKLREQFGNNK; TLPCRIKQIINMWQEVGKAMYAP 
NNLLRAIEAQQHLLQLTVWGIK 
SNKSLEQIWNNMTWMEWDREIMNYTSLIHSLIEESQNQ 
45 ELLELDKWA5LWNWFNITNWLWY 
IVTRIVELLGRRGWEALKYWWNLLQYWSQELK 
DRVIEWQGAYRAIRHIPRRIRQGLERILL ; ASVLSGGELDRWEKIRLR 
ETSEGCRQILGQLQPSLQT; EIKDTKEALDKI EEEQKK , 

SPRTLNAWVKVVEEK; DIRQGPKEPFRDYVDRFYKTLRAEQA ; 
ARVLAEAMSQVTNTATI; HEDI I SLWDQSLK ; and 
KQIINMWQEVGKAMYA; and 



50 



T-cell sites env Ti , residues 428-443, and env T2, residues 112-124, or portions thereof. 

The present invention is critical to the manufactur of peptide vaccines capable of eliciting T-cell 
55 immunity. One aspect of the present invention is the discovery of certain traits which seem to be common 
to most T-cell stimulating protein segments. Such peptide vaccines should optimally be those protein 
segments which (a) have a propensity to form amphipathic alpha-helices; (b) do not have regions with a 
propensity to coil formations; and (c) have a lysine at their COOH-terminus. The last two observations are of 
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particular use in manufacturing peptides vaccines: th y indicate where the synthetic peptid s should be 
terminated. 

Description of the Figures 

Figure 1 shows the HTLV-III gp160 envelope protein. The gpl20 and gp41 proteins are shown along 
with recombinant proteins and synthetic peptides referred to In this study. Selected restriction site locations 
are shown on the map at the top along with the shaded leader peptide (1-37) and transmembrane region 
(519-534). The precursor gp160 is cleaved to form the gp120 and gp41 proteins. The dimensions and 
10 locations of the RIO and PB1 recombinant proteins and the env T1 and T2 synthetic peptides examined in 
this study are shown. The locations of known B cell epitopes (81 and 82) in the gp41 protein are also 
shown. 

Figure 2 shows the results of AMPHI analysis in the region of the env T1 and env T2 sites. 

Figure 3 shows lymph node proliferation assays of HTLV-III envelope gp120 and related recombinant 

75 and synthetic peptide antigens. Fi hybrid mice were immunized with either 10 micrograms of the large 
recombinant protein RIO (panels A and 8 labeled NP) or 3 nanomoles of peptide env T1 (panels C and D 
labeled PN) in 50 microliters of complete Freund's adjuvant (DIFCO, Detroit, Ml) at the base of the tail. 
Eight days later the draining inguinal and periaortic lymph nodes were removed and a single cell 
suspension prepared. Assays were set up in quadruplicate with appropriate antigen In 96-well plates with 3 

20 X 10^ cells per well in complete medium consisting of RPMI 1640 with 44% Eagle's- Hanks' amino acid 
medium, 10% fetal bovine serum. 5 x 10"^ M 2-mercaptoethanol, 2mM fresh-frozen L-glutamine (Gibco, 
Grand Island, NY), 100 U/ml penicillin, and 100 micrpg rams/ml streptomycin (Gibco). Plates were Incubated 
for 4 days at 37* in a 5% CO2 incubator, pulsed with 1 microCi of pH]-thymidine (New England Nuclear, 
Boston MA) and harvested 18 hours later onto glass fiber paper. Thymidine incorporation into DNA was 

25 then quantitated by liquid scintillation counting. 

The geometric mean and standard error of the mean for each group were determined and the no 
antigen background subtracted to obtain the delta cpm. The no antigen backgrounds were: A: 21 ,771 ; 8: 
17,844; C: 30,674; D: 29,298. The confidence intervals for the background values are shown at the zero 
position of each vertical axis (n = 8). The panels are scaled according to the magnitude of the PPD positive 

30 control response. 

Figure 4 shows the response of env T1 peptide immune lymph node cells to gp120 and related 
antigens in the independent parental mouse strains. C57BL/6, C3H/HeJ, A.SW, and BALB/c mice were 
immunized with the 16-residue env T1 peptide and lymph node proliferation assays were performed as 
described in the legend to Figure 3. SW1 02 is a peptide representing sperm whale myoglobin residues 

35 102-118. The no antigen (0) and PPD negative and positive controls are shown in the first position of each 
panel. The panels are scaled according to the magnitude of the PPD response. The no antigen back- 
grounds were: 86: 16,334, C3H: 74,253; A.SW: 28.771; 8ALB/c: 34,600. 

Figure 5 shows alpha helical net representation of the env T2 and env T1 sites. This display can be 
thought of as slicing the cylinder of the helix lengthwise down one face, opening and flattening it. There are 

40 3.6 residues per turn of the helix. The hydrophobic residues are shaded. Residues common to both sides 
are boxed. Regions outside of the peptides are shaded according to hydrophobicities of residues in the 
gp120 sequence. 

Detailed Description of Preferred Embodiments of the Invention 

45 

Prediction of T-cell antigenic peptides has important implications for the development of artlfical 
vaccines. Such vaccines are particularly useful in diseases like leprosy, caused by organisms which are 
hard to culture and for which the cellular arm of the immune system Is the principal defense. Even when 
antibody production is the primary goal of vaccination, a secondary or anamnestic response requires the 

50 induction of helper T-cell immunity. Prediction of peptides for use as vaccines requires discovery and 
confirmation of properties correlating with T-cell antigenicity. One of the purposes of this invention is to use 
such properties in a process capable of reliably predicting T-cell stimulation by a protein segment. 

T-celi immunity is not only important In the defense against many viral infections; t-cell help is also 
necessary for a memory antibody response. Howev r, only a limited number of segments of protein 

55 antigens elicits T-cell immunity. None have previously been found for any proteins of the AIDS virus. The T- 
c II antigenic sites from AIDS viral proteins covered by the present invention should be very useful in any 
vaccine developed for this virus. The present invention also shows that one of the peptides elicits T-cell 
immunity to the native AIDS gp120 envelope protein. Thus, these peptides may constitut all or part of a 
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synthetic vaccine. Also, a fragment vaccin made by recombinant DNA or other technology should be 
designed to include these sites. 

The experimental peptides containing the immunodominant sites are defined herein as antigenic sites . 
"Antigenicity" in this invention always refers to T-cell antigenicity. 

5 In vivo, an antigenic protein probably passes through three main steps before raising a helper T-cell 
respohser(a) "processing": an antigen-presenting cell (APC), usually a macrophage, dendritic or B cell, 
ingests the protein and then digests It into smaller peptides; (b) "presentation": these peptides are then 
presented to T-cells, probably in conjunction with a Class II Major Histocompatibility Complex Protein on 
the APC surface; and (c) "recognition": a helper T-cell receptor then recognizes some combination of 

10 peptide and Class tl Protein, and initiates a T-cell response. 

Two antigenic properties are thought to contribute to this process, amphipathlcity and Alpha-helicity, 
based on the findings in this invention. 

A structure is amphipathic when it has both a hydrophobic portion and hydraulic portion, A peptide is 
segmentally amphipathic when the peptide contains at least two disjoint subpeptides, one hydrophobic, the 

75 other hydrophilic. A peptide is alpha-amphlpathic If, when the peptide is put Into an alpha-helical 
conformation, one side of the alpha-helix is hydrophobic, the other side hydrophilic. A peptide is helically 
amphipathic if, when put into an alpha or 3io helix, or similar helical structure, one side of the helix is 
hydrophobic and the other side hydrophilic. Both segmental amphipathlcity and helical-amphipathicity are 
believed to contribute to T-cell antigenicity, though opinions about their relative importance differ. 

20 AMPHI was used to analyze the gp120 envelope protein of the HTLV-lllb isolates of HIV for sequences 
consistent with formation of amphipathic helices as potential T cell sites. Sites were ranked according to the 
apparent strength of helical amphipathlcity as reflected In the Amphipathic Score, and frequencies were 
examined for consistency. Sites were further selected for occurrence In constant regions of gp120 (based 
on a comparison of the sequence of six Isolates) and for absence of N-linked glycosylatlon sites. AMPHI 

25 parameters for the two most favorable sites are shown in Figure 2. Candidate T cell sites were selected by 
including appropriate flanking residues. Candidate T cell sites env T1 and env T2 were defined as residues 
428 through 443 and 112 through 124, respectively. The standard epitope nomenclature employed consists 
of viral isolate, protein designation, site type, and assigned number or residue number (e.g. HTLV-III (bHIO)- 
env T1 ). 

30 Synthetic peptides corresponding to these sites were prepared by solid phase peptide synthesis. 
Peptides were synthesized using standard methods of solid phase peptide synthesis on a Vega 250 peptide 
synthesizer using double dicyclohexylcarbodlimlde mediated couplings. HOBT preactivation couplings were 
' performed when coupling Gin or Asn. The standard t-boc/benzyl amino acid protection strategy was 
employed with side chain protection of the following amino acids: Asp(O-benzyl), Glu(0-ben2yl), Hls(tosyl), 

35 Lys(2-chlorobenzyloxycarbonyl). Ser(benzyl), Trp(formyl), Tyr(2,6-dichlorobenzyl). The extent of coupling 
was monitored using the qualitative ninhydrin test and recoupling performed when less than 99.4% coupling 
was observed. Peptides were cleaved from the resin using the low/high hydrogen fluoride (HF) method (J.P. 
Tam, W.F, Heath, and R.B. Merrifield, J. Am. Chem. Soc, 105, 6442 (1983)). For peptide env T2 standard 
HF cleavage was employed as removal of the tryptophan formyl protecting group was found not to be 

40 required for antigenic activity. Peptides were purified to homogeneity by gel filtration on Biogel P4 in 9% 
formic acid followed by reverse phase HPLC as described previously. Composition was confirmed and 
concentration determined by amino acid analysis. Native gp120 was purified by selective detergent 
extraction and Immunoaffinity chromatography followed by dialysis. The recombinant proteins RIO and PB1 
were produced by cloning restriction fragments Kpnl(5923) to Bgl 11(7197) or Pvu 11(6659) to Bgl 11(7197) 

45 from the BH10 clone of HTLV-lllb into the Repligen Expression Vector (REV) followed by expression in E. 
coli and purification. Protein RIO contains residues 49 through 474 and PB1 residues 294 through 474 of 
the envelope protein. 

As a genetically defined model of an outbred population, there was studied the immune response to 
these proteins In (C57BU6 x C3H/HeJ)Fi and (A.SW x BALB/c)Fi mice (H-^^** and H-2«^ respectively). 
50 This strategy provides for full H-2 expression and complementation In the context of lour different strain 
backgrounds. T cells have been shown to be responsible for the proliferation observed In antigen specific 
lymph node proliferation assays and consequently such assays were employed as a measure of T cell 
immunity. 

Quantities of purified gpl20 availabi pr eluded use in immunization and thus the RIO protein 
56 containing the majority of the gp120 s qu nee in nonglycosylated form was th largest immunogen used. 
Figure 3, panels A and B, shows the lymph node proliferative response of RIO immune mice tested with 
native gp120, recombinant proteins, and synthetic peptide env Tl. In both Fi hybrids a strong response 
was observed not only to the immunog n RIO. but also to gpl20. Therefor . the r sponse was largely 
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directed at envelop sequ nee and not at the irr I vant vector-deriv d flanking sequenc needed r sidu s 
in the reconnblnant protein. Thus the recombinant R10-fragment is an effective immunogen for prinning for a 
response to the native gp12p. The response to the synthetic peptide env T1 indicates that a significant 
component of the T cell response to the 425 residue R10 is in fact focused on the 16 residue env T1 site. 
5 In other experiments with RIO immune lymph node cells a response to peptide env T2 similar to that to 
peptide env T1 was observed (Table I). 



TABLE I 

^0 RESPONSE TO ENV T2 PEPTIDE IN 

R10(49-474)-IRMUNE F. HYBRID MICE 
RELATIVE TO NATIVE gpl20 and env Tl PEPTIDE 



Hybrid Mice 



Ant ieen 


(B6 X C3H)Fi 


(A.SW X BALB/c)Fi 


gpl20 


69,738 


65,949 




(1.01) 


(1.07) 


env Tl 


17,686 


2S, 140 




(1.12) 


(1.15) 


env T2 


20,703 


23,332 




(1.18) 


(1.14) 


med i um 


10,864 


13,381 




(1.09) 


(1.07) 



pH]-thymidine incorporation is shown for each group expressed as the geometric mean counts per minute 
30 with the standard error term for quadruplicate samples shown in parenthises. N = 8 for the medium controls. 
Antigen concentrations were 0.075 micromolar for gp120 and 4.8 micromolar for the env Tl and T2 
peptides. 

Given that immunization with a large fragment spanning most of the gp120 sequence elicits a response 
partially focused on a small site defined by a synthetic peptide (a native immunogen/peptide test-antigen or 

35 NP experiment), we next asked whether immunization with the synthetic peptide would elicit immunity to 
the native protein (a peptide immunogen/native test-antigen or PN experiment). Immunogenicity in the PN 
direction would appear to be a prerequisite for efficiency as a vaccine site. The results of such an 
experiment in the Fi mice is shown in Figure 3, panels C and D. Mice immunized with env Tl peptide 
showed substantial immunity not only to the env T1 immunogen but also to the native gp120 as well as to 

40 the recombinant proteins. Thus a 16-residue synthetic peptide selected on the basis of amphipathicity can 
elicit T cell immunity to the native AIDS virus protein. 

To further characterize genetic restriction of the response to env Tl there was studied the independent 
H-2 disparate parental strains from which the Fi hybrids have been derived: 0576176, C3H/HeJ, A.SW and 
BALB/c. Mice were immunized with env T1 peptide and studied with native and peptide antigens. As shown 

45 in figure 4, C57BI_/6 ( H-2 ^ haplotype) was found be a low responder, whereas the other strains (H-2'', H-2^, 
and haplotypes) were intermediate or high responders to the env T1 peptide. The responselo native 
gp120 paralleled that to the peptide. A corresponding pattern of responsiveness is also observed in 
experiments using H-2 congenic strains of mice (Table II). Thus peptide env Tl represents a 16 residue 
peptide that can prime T cells for a secondary response to the 518 residue glycosylated native gp120 in 

50 multiple MHC haplotypes. 



55 
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TABLE II 

RESPONSE TO gpl20 IN ENV Tl 
PEPTIDE- IMMUNE H-2 CONGENIC MICE 



Congenic Strains 



Ant ieen 


BI0.A(5R) 


BIO. BR 


B10.S(9R) 


610. D2 


PPD 


85,5 11 


100.872 


71,006 


44,564 




(1.06) 


(1.07) 


(l.OS) 


(1.17) 


gpl20 


45,857- 


69,456 


68,219 


64,858 


(1.02) 


(1.05) 


(1.06) 


(1.10) 


med i um 


29,715 


40,639 


22,863 


19,665 




(1.03) 


(1.04) 


(1.04) 


(1.06) 



pH]-thymidine incorporation is shown for each group expressed as the geonnetric nnean counts per minute 
with the standard error term for quadruplicate samples shown in parantheses. N = 8 for the medium controls. 

20 Antigen concentrations were 0.075 micromolar for gp120 and 32 micrograms per milliliter for PPD. 

An unexpected finding was the striking cross reaction between env T1 and env T2 peptides. The env 
Tl immune cells responded to env T2 as well as to the immunizing peptide. Cross reactivity of env T2 was 
most pronounced on the H-2'* haplotype. Prompted by this finding, the two sequences were compared and 
there was a degree of homology which was even more evident when considered in the context of possible 

25 alpha helical structure as shown in Figure 5. Not only do env Tl and env T2 share the hydrophobic lle-lle- 
Xxx-Yyy-Trp cluster on the hydrophobic face and the Lys on the hydrophilic face of the helix, but also the 
spatial relationship between these is identical. Gin and acidic amino acids (Glu, Asp) neighboring the Lys 
are observed in both cases as well. The poor reactivity to peptide 102-118 of sperm whale myoglobin which 
is derived from an unrelated protein and shares minimal homology with env T1 indicates that the property 

30 of being an amphipathic alpha-helical peptide is not sufficient for cross-reactivity. As an additional 
specificity control. gp120, env Tl and env T2 were tested using lymph node cells immune to an unrelated 
antigen, sperm whale myoglobin, and were found to be non-stimulatory (data not shown). 

Though species differences are certain to influence the T cell repertoire, the molecules and mecha- 
nisms leading to a T cell response are conserved across species and thus the factors determining 

35 immunodominance are similar as well. The one helper T cell site (from influenza virus) that has been 
characterized at the synthetic peptide level in man is in fact immunodominant in mice as well and has no 
amino acid sequence consistent with formation of a highly amphipathic alpha helix. 

Analysis of the various viral proteins in the thirteen HIV isolate sequences published shows the 
envelope gene to be the most variable. Such analysis reveals discrete variable and conserved regions. A 

40 polyvalent vaccine might thus be appropriate. Both the env Tl and the env T2 sites are highly conserved in 
the HIV sequences reported to date. The env T2 site is the more highly conserved of the two sites 
especially in the LAVeu isolate. The previously described B cell sites (B1 and B2 in Figure 1) fall within 
conserved regions as well. 

Induction of T cell immunity may contribute in several ways to protection against HIV infection. Though 

45 AIDS progresses despite the presence of detectable antibody to viral proteins in most patients, low titer 
neutralizing antibody has been demonstrated in many such patients. Neutralizing titers are substantially 
higher in healthy AIDS related complex patients and in HIV antibody positive hemophiliacs. Whether this 
relationship is causal or simply correlative is as yet unknown. If these or any other antibodies are in fact 
protective, provision of optimum T cell help at the time of immunization as well as when faced with an 

60 infectious challenge would appear essential. Substantial T cell help should also be required for an effective 
cell mediated response to infected cells. NK cells have been shown to selectively kill HIV infected ceils in 
vitro. Given that a major mode of viral transmission in the infected patient is thought to be cell to cell, I 
vaccine that primes helper T cells for augmentation of NK cell and possibly lymphokine activated killer cell 
activity is essential for an effective vaccine. Induction of virus-specific cytotoxic T lymphocyte (CTL) 

55 immunity, which also requires helper T cells, is also desirabf . While in some cases the d terminants 
recognized by CTL can in fact be defined by small peptides, in vivo expression of antigens such as in a 
vaccinia or adenoviral vector may be required for efficient ellcitation of a classical CTL response. The 
AMPHI algorithm was developed to identify h Iper T cell sites, and cons quently its relevance to CTL 
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specificity Is unknown. How ver, it does successfully id ntify tlie two cliaracteriz d sites in influenza 
nucleoprotein recognized by human and nnurine CTL 

The fact that helper T c N innmunity can be Induced with short peptides as well or better than with 
native protein stands in sharp contast to the situation with B cell innmunity for which tertiary structure Is 

5 frequently important, and indicates that peptide vaccines aimed at T cell immunity are more successful than 
those aimed at antibody production. In a synthetic peptide or recombinant fragment based construct, one 
could selectively include important helper T cell sites, in multiple copies If desired, and exclude suppressor 
T cell sites which can blunt the Immune response. Sites associated with specific functions or possible 
undesirable side effects such as the CD4 binding site(s), the site(s) mediating syncytia formation, or the 

10 neuroleukin homology site, can be systematically included or excluded. For vaccines designed to induce 
antibodies as well as T cell immunity, incorporation of pathogen-derived T cell sites along with important B 
cell sites obviates the need to chemically couple small peptides to Irrelevant carriers and disposes of 
coupling and carrier-derived problems. Thus the T cell sites Identified here are potentially important 
components of an effective AIDS vaccine. 

75 The present invention Includes a method of predicting which segments of a protein (along its entire 
sequence, if desired) are antigenic. In other words, the present invention Is a method of determining which 
sites of an entire protein sequence are recognized by T-cells (activate or stimulate T-cells). Application of 
this method is limited only by knowledge of the amino acid sequences of a protein, i.e., can be applied to 
any protein in the protein data base of the National Biomedical Research Foundation or any protein whose 

20 sequence Is subsequently published. Moreover, the analysis can be done using the amino acid sequence 
translated from a DNA gene sequence, without ever isolating the protein. The background experiments 
which made this process possible comprise, in their entirety, an examination of a number of properties to 
determine if a particular property or properties is Implicated in T-cell stimulation. 

The following properties were determined to be fundamentally important (with a high degree of 

25 significance) in determining the potential immunogeniclty of certain protein sequences: 

a. The helical amphipathicity of segments along the entire sequence of a protein; 

b. the conformational propensity of segments along the entire sequence of the protein; 

c. the presence or absence of helix-breakers In segments along the entire sequence of the protein; and 

d. the presence and location in the protein sequence of amino acid residues which favor T-cell 
30 recognition. 

These properties were used to develop an optimized algorithm for detecting T-cell antigenic sites 
(based on the amphipathic helix model) in a protein with known sequences. The optimum algorithm 
Identifies 18 of 23 known sites (75% sensitivity), with a high degree of significance (p<0.001). The success 
of the algorithm also shows that stable amphipathic structures such as amphipathic helices are fundamen- 
35 tally important in determining immunodominance. The optimized algorithm enables the predication of 
immunodominant T-cell sites on a protein. This prediction capability facilitates the rational design of 
synthetic vaccines, and facilitates other apprroaches to antigen specific T-celi recognition. 

EXAMPLES 

40 

Example 1 

The AMPHI algorithm was used to examine the HTLV-III envelope protein amino acid sequence for sites 
with periodic variation in the hydrophoblcity consistent with formation of an amphipathic helix (alpha or 3io). 

45 Overlapping blocks of 1 1 residues were examined and resultant parameters assigned to the middle residue. 
The results for the residue 100-200 and 400-500 regions encompassing the env T2 and T1 sites 
respectively appear on the left and right of Figure 2. The upper panels show the amphipathic index, a 
measure of intensity of amphipathicity, determined at a frequency of 100* or 120* per residue. The higher 
of the two is shown. The lower panels display the frequency where the maximum amphipathic signal is 

50 observed. The sites were selected based on their consistently high amphipathic indices with maxima in the 
helical frequency range. The amino acid sequence of the indicated sites are: env T2(1 12-124): His-Glu-Asp- 
lle-lle-Ser-Leu-Trp-Asp-Gln-Ser-Leu-Lys; env T1 (428-443): Lys-Gln-lle-lle-Asn-Met-Trp-Gln-Glu-Val-Gly-Lys- 
Ala-Met-Tyr-Ala. 

55 Example 2 

Applying the algorithm is a major step in predicting the most probable immunodominant sites that show 
amphipathic helical potency. The number and length of sites along a specific protein depend on the 
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hydrophobicity profile of that protein. There are proteins that show a high degr e of amphipathic h Ileal 
potency (and contain many pr dieted sites), while others are poor in amphipathic segments. After having 
predicted all the possible amphipathic helical segments, the segments must be graded. The present 
invention prefers the use of three factors for grading purposes; a) amphipathic score (particularly useful 

5 when comparing segments of the same length); b) the rarity of proline in helices in general (except near the 
NH2-terminus), and in most of the helical antigenic sites in particular; and c) the appearance of lysine at the 
carboxyl end in a large number of helical antigenic sites. It has been found that lysine as the ultimate or 
penultimate C-terminal residue occurs much more frequently in immunodominant sites. In short, a preferred 
sequence contains amphipathic segments with proline, if present, only near the N-terminus, the lysine near 

10 the C-terminus. 

Another possible indicator is the presence of N-glycosylation sites-these sites are indicative of a less 
favorable candidate for an . immunodominant site, because the T-cell epitope may be masked by the 
carbohydrate. 

75 Example 3 

Using the process of this invention, the following segments were predicted to be T-cell stimulation sites: 
1) segment 93-127 



20 



25 



PheAsnMetTrpLysAsnAspMetValGluGlnMe tHi sGluAspI lei leSerLeu- 
TrpAspGlnSerLeuLysP roCysValLysLeuThrProLeuCysVal 

2) segment 415-437 



30 ThrLeuProCysArgI l eLysGlnl lei leAsnMetTrpGlnGluValGlyLysAla- 
MetTyrAlaP ro 



36 The numbering sequence used in this invention corresponds to Ratner et al. Nature, 313:277-284 
(1985). 

The underlined portion of each segment has been shown by other methods to be a T-cell stimulation 

site, 
40 Example 4 

The following segments from the envelope region of the HIV genome have been predicted as 
candidates for T-cell stimulation sites: 



45 



50 



segment 231-250 
segment 307-331 
segment 335-357 
segment 553-574 



segment 615-652 
segment 659-681 
segment 777-808 
segment 827-856 



The following segments from the gag region of the HIV genome have been predicted as candidates for T- 
cell stimulation sites: 



segment 5-22 


segm 


nt 148-162 


segment 52-70 


segm 


nt 284-309 


segment 93-110 


segm 


nt 360-376 
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The numbering sequence used in this invention corresponds to Ratner et al, Nature , 313:277-284 
(1985). 

The amino acid sequences for these peptides are shown in Table 3. 

TABLE 3 

From the env protein: 
Segment 93-127 

PheAsnMetTrpLysAsnAspMe tVa IGluGlnMe t Hi sGluAspI lei leSerLeu- 
TrpAspGlnSerLeuLysP roCysValLysLeuThrProLeuCysVal 

Segment 23 1-250 

LysThrPheAsnGlyThrGlyProCysThrAsnValSerThrValGInCysThrHi s- 
Gly 

Segment 307-33 1 

IleArgI leGlnArgGlyProGlyArgAlaPheValThrlleGlyLysI leGlyAsn- 
MetArgGlnAlaHisCys 

Segment 335-357 

ArgAlaLysTrpAsnAsnThrLeuLysGlnl leAspSerLysLeuArgGluGlnPhe- 
GlyAsnAsnLys 

Segment 415-437 

ThrLeuProCysArgI l eLysGlnl lel leAsnMetTrpGlnGluVa IGlyLysAla- 
MetTyrAlaP ro 

Segment 553-574 

AsnAsnLeuLeuArgAlal 1 eGluAlaGlnGlnHi sLeuLeuGlnLeuThrVa ITrp- 
GlylleLys 

Segment 6 15-652 

SerAsnLysSerLeuGluGlnl leTrpAsnAsnMe tThrTrpMetGluTrpAspArg- 
GluI leAsnAsnTyrThrSerLeuI lelii sSerLeul leGluGluSerGlnAsnGln 

Segment 659-68 1 

GluLeuLeuGluLeuAspLysTrpAlaSerLeuTrpAsnTrpPheAsnl leThrAsn- 
TrpLeuTrpTyr 

Segment 777-808 

IleValThrArgI leValGluLeuLeuGlyArgArgGlyTrpGluAlaLeuLysTyr- 
TrpTrpAsnLeuLeuGlnTyrTrpSerGlnGluLeuLys 

Segment 827-856 

AspArgVall leGluValVa IGlnGlyAlaTy rArgAlal leArgHi si leProArg- 
Argl leArgGlnGlyLeuGluArgl leLeuLeu 

From the gag protein: 

Segment 5-22 

AlaSerValLeuSerGlyGlyGluLeuAspArgTrpGluLysI leArgLeuArg 
Segment 52-70 

GluThrSerGluGlyCysArgGlnl leLeuGlyGInLeuGlnProSerLeuGlnThr 
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Segment 93-1 10 

Glul leLysAspThrLysGluAlaLeuAspLysI leGIuGluGluGlnAsnLy s 
Segment 148*162 

SerProArgThrLeuAsnAlaTrpValLysValValGluGluLys 
Segment 284-309 

AspIleArgGlnGlyProLysGluProPheArgAspTyrValAspArgPheTyrLys- 
ThrLeuArgAlaGluGlnAla 

Segment 360*376 

AlaArgValLeuAlaGluAlaMetSerGlnValThrAsnThrAlaThrl le 



Ala 


A 


Alanine 


Leu 


L Leucine 


Arg 


R 


Arginine 


Lys 


K Lysine 


Asn 


N 


Asparagine 


Met 


M Methionine 


Asp 


D 


Aspartic acid 


Phe 


F Phenylalanine 
P Proline 


Cys 


C 


Cysteine 


Pro 


Gin 


Q 


Glutamine 


Ser 


S Ser ine 


Glu 


E 


Glutamic acid 


Thr 


T Threonine 


Gly 


G 


Glyc ine 


Trp 


W Tryptophan 


His 


H 


Hist idine 


Tyr 


Y Tyros ine 


lie 


I 


Isoleucine 


Val 


V Valine 



Claims 

1. Isolated peptides or portions thereof adapted to serve as components of a vaccine for acquired immune 
deficiency syndrome (AIDS), characterized in that said peptides are capable of inducing T cell 
response specific to acquired immunodeficiency syndrome (AIDS) virus and in that said peptides are 
selected from amino acid sequences in single letter code: 



FNMWKNDMVEQMHEDIISLWDQSLKPCVKLTLCV; 
KTFNGTGPCTNVSTVQCTHG; IRIQRGPGRAFVTIGKIGNMRQAHC ; 
RAKWNNTLKQIDSKLREQFGNNK; TLPCRIKQIINMWQEVGKAMYAP; 
NNLLRAIEAQQHLLQLTVWGIK; 
SNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQ; 
ELLELDKWASLWNWFNITNWLWY ; 
IVTRIVELLGRRGWEALKYWWNLLQYWSQELK; 
DRVIEWQGAYRAIRHIPRRIRQGLERILL; ASVLSGGELDRWEKIRLR; 
ETSEGCRQILGQLQPSLQT; EIKDTKEALDKIEEEQNK ; 

SPRTLNAWVKVVEEK; DIRQGPKEPFRDYVDRFYKTLRAEQA; 
ARVLAEAMSQVTNTATI ; HEDI I SLWDQSLK ; and 
KQIINMWQEVGKAMYA; and 

T-cell sites HTLV-lllb env Ti , residues 428-443, and env T2, residues 1 12-124, or portions thereof. 

2. Synthetic peptides comprising T cell sites HTLV-lllb env TI and env T2 which are residues 428-443 
and 112-124, respectively, utilizing standard epitope nomenclatur . 

3. Synthetic peptides corresponding to the sites of Claim 2. 
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4. Synthetic peptides substantially corresponding to the sites of Claim 2, or portions thereof. 

5. Peptid s according to any preceding clainn wherein said peptides correspond to sequences from HIV 
viral proteins and when Included in a vaccine, are capable of Inducing T-cell immunity to acquired 
Immune deficieny syndrome. 

6. The peptides of Claim 5. wherein said T-cell response Is the Induction of T-cell Immunity. 

7. The peptides of Claim 5, wherein said T-cell response Is the induction of T-cell help for an antibody 
response. 

8. A synthetic peptide capable of eliciting a T-cell response, said peptide substantially corresponding to 
amino acid sequences 428-443 of the HIV envelope protein. 

9. A synthetic peptide capable of eliciting a T-celi response, said peptide substantially corresponding to 
amino acid sequences 112-124 of the HIV envelope protein. 

10. The preparation of synthetic peptides according to the sites of Claim 2, which includes the steps of 
defining peptides by searching for segments which have periodicity of hydrophoblcity properties 
wherein said segments exhibit the capability to form amphlpathic helices which are recognized by'T 
cell antigenic sites, and synthesizing said peptides. 

11. Use of the peptides of Claim 1 in the preparation of a medicament for the treatment of AIDS. 

12. Use of Claim 11, wherein said medicament is a vaccine. 
PatentansprUche 

1. Isolierte Peptide Oder Telle hiervon, die dafUr ausgebildet sind, als Komponenten eines Impfstoffs fGr 
das erworbene Immundefektsyndrom (AIDS) zu dienen, dadurch gekennzeichnet, daB die Peptide In 
der Lage sind, eine speziflsche T-Zellen-lmmunantwort auf den Virus des erworbenen Immundefektsyn- 
droms (AIDS) zu Induzleren, und dafi die genannten Peptide aus Aminosauresequenzen ausgewahit 
sind, die im Einbuchstabencode lauten: 



FNMWKNDMVEQMHEDIISLWDQSLKPCVKLTLCV 
KTFNGTGPCTNVSTVQCTHG; IRIQRGPGRAFVTIGKIGNMRQAHC 
RAKWNNTLKQIDSKLREQFGNNK; TLPCRIKQIINMWQEVGKAMYAP 
NNLLRA IEAQQHLLQLTVWGIK 
SNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQ 
ELLELDKWASLWNWFNITNWLWY 
IVTRIVELLGRRGWEALKYWWNLLQYWSQELK 
DRVIEWQGAYRAIRHIPRRIRQGLERILL; ASVLSGGELDRWEKIRLR 
ETSEGCRQILGQLQPSLQT; E I KDTKE A LDK I EEEQNK 

SPRTLNAWVKVVEEK; DIRQGPKEPFRDYVDRFYKTLRAEQA 
ARVLAEAMSQVTNTATI ; H ED I I S LWDQS LK ; und 

KQIINMWQEVGKAMYA; unci 



T-Zellen-Mutationsbereiche HTLV-lllb env T1, Reste 428-443, und env T2, Reste 112-124 oder Teile 
hiervon. 

2. Synthetische Peptide mit T-2ellen-Mutatlonsbereichen HTLV-lllb env T1 und env T2, wobei es sich urn 
Reste 428-443 bzw. 112-124 handelt, unter Anwendung der Standard-Epitop nomenklatur. 

3. Synthetische Peptide entsprechend den I\/Iutatlonsbereichen nach Anspruch 2. 
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4. Synthetische Peptide im wesentiichen entsprechend den Mutationsbereichen nach Anspruch 2 oder 
Teilen hiervon. 

5. Peptide nach einem der vorhergehenden AnsprOche, wobei die genannten Peptide Sequenzen von HIV- 
Virusproteinen entsprechen und, wenn sie in einen Innpfstoff eingebracht sind, in der Lage sind. eine T- 
Zellenimnnunitat gegen das erworbene Immundefektsyndrom zu induzieren. 

6. Peptide nach Anspruch 5, wobei die T-Zellen-lnDnnunantwoi1 die Induktion von T-Ze!l8n-lmmunitat ist. 

7. Peptide nach Anspruch 5, wobei die T-Zellen-lmmunantwort die Induktion von T-Zellen-Hilfe fOr eine 
Antikorper-lmmunantwort ist. 

8. Synthetisches Peptid, das in der Lage ist, eine T-Zellen-lmmunantwort herbeizufOhren, wobei das 
genannte Peptid im Wesentiichen den Anninosauresequenzen 428-443 des HIV- HQ II proteins entspricht. 

9. Synthetisches Peptid, das in der Lage ist, eine T-Zellen-lmnriunantwort beizufQhren. wobei das genann- 
te Peptid im wesentiichen den Aminosauresequenzen 112 bis 124 des HIV-HUIIproteins entspricht. 

10. Herstellung Synthetischer Peptide entsprechend den Mutationsbereichen nach Anspruch 2, welches die 
Schritte des Definierens von Peptiden durch Suchen nach Segmenten, die Periodizitat von 
Hydrophobie-Eigenschaften haben, wobei die genannten Segmente die Fahigkeit zeigen. amphipati- 
sche Spiralen zu bilden, die von antigenen T-Zellenbereichen erkannt werden, und des synthetischen 
Herstellens der genannten Peptide umfaBt. 

11. Verwendung der Peptide nach Anspruch 1 bei der Zubereitung eines Medikaments zur AIDS-Behand- 
lung. 

12. Verwendung nach Anspruch 11, wobei das Medikament ein Impfstoff ist. " 
Revendicatlons 

1. Peptides ou parties de peptides, a I'^tat isol6. adaptes pour §tre utilises comme constituants d'un 
vaccin contre le syndrome d'immunod^ficience acquise (SIDA), caract^ris§ en ce que lesdits 
peptides sont susceptibles d'induire une r^ponse des lymphocytes T, sp^cifique au virus du syndrome 
d'immunodeficience acquise, et en ce que lesdits peptides sont choisis parmi les sequences suivantes 
d'amino-acides Identifies selon le code k une lettre : 



FNMWKNDMVEQMHEDIISLWDQSLKPCVKLTLCV 
KTFNGTGPCTNVSTVQCTHG; IRIQRGPGRAFVTIGKIGNMRQAHC 
RAKWNNTLKQIDSKLREQFGNNK; TLPCRIKQIINMWQEVGKAMYAP 
NNLLRAIEAQQHLLQLTVWGIK 
SNKSLEQIWNNMTWMEWDREINNYTSLIHSLIEESQNQ 
ELLELDKWASLWNWFNITNWLWY 
IVTRIVELLGRRGWEALKYWWNLLQYWSQELK 
DRVIEWQGAYRAIRHIPRRIRQGLERILL ; AS VLSGGELDRWEKIRLR 
ETSEGCRQILGQLQPSLQT; EIKDTKEALDKIEEEQNK 
SPRTLNAWVKVVEEK; DIRQGPKEPFRDYVDRFYKTLRAEQA 
ARVLAEAMSQVTNTATI; H ED I I S LWDQS LK ; et 

KQIINMWQEVGKAMYA; 



et parmi les fragments de HTLV-lllb, 428-443, dits env Ti. et 112-124, dits env T2, constituant des sites 
de combinaison avec des lymphocytes T, ou des parti s d sdits fragm nts. 

2. Peptides de synthase comprenant les segments de HTLV-lllb. constituant des sites de combinaison 
avec des lymphocytes T, dits env Ti et env T2, qui sont les fragments 428-443 et 112-124 
respectivement, selon la nomenclature standard des Epitopes. 
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3. Peptides de synthase corr spondant aux sites de combinaison selon la r vendication 2, ou parties 
desdits sites. 

4. Peptides de synthase correspondant substantiellement aux sites de combinaison selon la revendication 
2. ou parties desdits sites. 

5. Peptides selon I'une quelconque des revendications pr^c^dentes, selon lesquels lesdtts peptides 
correspondent h des sequences appartenant h des prot^ines du virus HIV. et susceptibies. lorsqu'in- 
clues dans un vaccin, d'induire une imnnunit^ des lynnphocytes T vis-^-vIs du syndronne de Timmuno- 
d^ficience acquise. 

6. Peptides selon la revendication 5, selon lesquels ladite r^ponse des lymphocytes T est I'induction 
d*une immunity des lymphocytes T. 

7. Peptides selon la revendication 5, selon lesquels ladite r^ponse des lymphocytes T est I'induction des 
lymphocytes T auxiliaires en vue d'une r^ponse avec un anticorps. 

8. Peptide de synthase susceptible d'induire une r^ponse des lymphocytes T. ledit peptide correspondant 
substantiellement aux sequences 428-443 d'aminoacldes de la prot^ine d'enveloppe de HIV. 

9. Peptide de synthese susceptible d'induire une r^ponse des lymphocytes T, ledit peptide correspondant 
substantiellement aux sequences 112-124 d'aminoacldes de la prot^ine d'enveloppe de HIV. 

10. Preparation de peptides de synthase correspondant aux sites de combinaison selon la revendication 2, 
comprenant les Stapes sulvantes : 

- determination des peptides en recherchant les segments pr^sentant p^riodiquement un caract^re 
hydrophobe, et manifestant la possibility de se conformer en helices amphiphiles, reconnues par 
des sites de combinaison antigenique des lymphocytes T 

- et synthase desdits peptides. 

11. Utilisation des peptides selon la revendication 1, pour preparer un medicament pour le traitement du 
SIDA. 

12. Utilisation selon la revendication 1, selon laquelle ledit medicament est un vaccin. 
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A. NP (B6xC3H)F, B . NP lA.SW x BALB/c)F, 




C. PN (B6 X C3H)F, D. PN (A.SW x BALB/c)F, 




ANTIGEN CONCENTRATION (nM) 



Fig. 3 
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